Ego-net link prediction with GNN (in English)
Annotation
The task of link prediction is one of the key challenges in the field of social network analysis. The common way to build such systems is based on the idea of decomposing a task into two levels. At the first level, links within ego-nets are predicted; at the second, the results are aggregated to form the final predictions. The accuracy of such systems depends on the first-level model. Heuristic methods are usually used here. The focus of this work is on developing a new supervised model to improve the quality of link prediction within ego-nets. The heterogeneity of the edge attributes, the absence of node features, and the dynamic nature of ego-nets distinguish this task from others. The proposed method belongs to the class of graph neural networks. Its key feature is the ability to effectively consider the topology of the graph along with the attributes of the edges, without relying on the properties of the nodes. This effect is achieved by modeling the hidden state of node pairs, rather than the state of each node individually. The iterative nature of the model makes it possible to propagate knowledge about the relationships between nodes, increasing the complexity of the structures considered with each step. To measure the accuracy of the model, the Ego-VK dataset was used. This dataset consists of a set of ego-nets from a subsample of users of the VKontakte social network. The model is compared with the classical Adamic-Adar method as well as modern approaches based on graph neural networks. Experiments show that the proposed model is significantly superior to the baselines with respect to NDCG@5 ranking quality metric. The results demonstrate the high effectiveness of the proposed model, and the possibility of integration into distributed systems makes it widely applicable in the industry.
Keywords
Постоянный URL
Articles in current issue
- Fluorescence studies of natural photosensitizers in oncology and antimicrobial therapy
- Review of deep learning methods for imaging photoplethysmography data processing
- Effect of heat treatment on the growth and luminescence of quantum dots CsPbI3 in fluorophosphate glass
- Study of nanopipettes conductivity depending on their shape and size
- Thermal conductivity of multilayer hexagonal boron nitride nanoscrolls
- Integrated control algorithm for obstacle and singularity avoidance in a robotic manipulator
- Method of automatic generation of the informative space for identifying information security events in corporate computer networks
- Spectral-based multi-band recurrent neural networks for black-box modeling of dynamic range compressors (in English)
- Hierarchical multi-task learning for low-complexity models based on task synergy analysis
- Detection of network anomalies in the Internet of Things environment using modified statistical criteria and ensemble methods
- Automatic detection of software design patterns using a language model on transformer architecture (in English)
- Multi-task human’s psychological profile analysis based on text data using semi-supervised learning
- Modeling and optimization of information flows in electronic document management systems under information security threats
- Series-parallel architecture for the FPGA implementation of neural networks trainable in real-time using the error backpropagation algorithm
- An approach to contextual example mining for DGA domain identification using large language models
- Analysis of the effectiveness of optimizing behavioral descriptions of hardware in logic synthesizers for FPGA
- Spheroidal models of ore deposits in the framework of gravity tomography
- Prediction of maximum stresses in the shaft–insert system using a neural network
- Estimation criterion and method for optimizing the redundancy of video images in surveillance systems
- Generating spatiotemporal network load series in multi-access edge computing tasks using open data
- Application of hybrid artificial intelligence methods to practical industrial tasks under conditions of scarce training data
- Implementation and investigation of a reservoir computer based on a hardware model of three-element spiking neuron
- Analysis of a centerless control scheme for profiles of large-sized shells in the process of their shaping
- Oblivious signature based on the theory of elliptic curve isogeny